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Abstract 

We propose an approximation of the asymptotic variance that 
removes a certain discontinuity in the usual formula for the raw and 
the smoothed periodogram in case a data taper is used. It is based 
on an approximation of the covariance of the (tapered) periodogram 
at two arbitrary frequencies. Exact computations of the variances 
for a Gaussian white noise and an AR(4) process show that the 
approximation is more accurate than the usual formula. 

Key words: Asymptotic variance, data taper, (smoothed) pe- 
riodogram. 



1 Introduction 



Spectral estimation is by now a stan dard topic in time ser i es ana lysis , and many 



excell ent books are available, e.g. iPercival and Waldenl (|1993l ) or iBloomfield 



( 2000l ) . The purpose of this short note is to propose an approximation of the 



asymptotic variance that removes a certain discontinuity in the usual formula 
for the raw and the smoothed periodogram in case a taper is used. The standard 
asymptotic variance of the raw peri odogram is independent of t he taper chosen, 
see Formulae (222b) and (223c) in Percival and Walden ( 19931 ). However, this 



changes when the raw periodogram is smoothed over frequencies close by. Then 
a variance inflation factor Ch, see (HJ), appears which is equ al to one if no taper is 



used a nd greater than one otherwise, compare Table 248 in lPercival and Walden 



(1993). The reason for this is that tapering introduces correlations between the 



raw periodogram at different Fourier frequencies. Because of this, the variance 
reduction due to smoothing is smaller in the case of no tapering. The above 
variance inflation factor is justified asymptotically when the number of Fourier 
frequencies that are involved in the smoothing tends to infinity (more slowly 
than the number of observations, otherwise we would have a bias). Hence, 
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if only little smoothing is used, then we expect something in between: some 
increase in the variance, but less than the asymptotic variance inflation factor 
Ch- We give here a formula, see ([5]), which is almost as simple as the inflation 
factor, but which takes the amount of smoothing into account. 



2 Notation and preliminaries 

Let {Xt}tez be a real- valued stationary process with observation frequency 
1/A, mean EpQ] = fj,, autocovariances s T := Cov(Xt, Xt+ T ) an d spectral den- 
sity S(f). We assume that X\, X2, ■ ■ ■ ,Xn have been observed. The t apere d 
periodogram (called direct spectral estimator in Percival and Walden ( 19931 )) 
is 

A 



S(tp)(f) 



N 2 

t=i 



for / € [0, 1/(2 A)]. Here the estimator ji is usually either the arithmetic mean 
X or the weighted average (£)t=i htX t ) / (YltLi ht)- The latter has the property 
that 5^(0) = 0. Since the choice is irrelevant for the asymptotics, we can use 
either version. The taper (hi, . . . , hp?) is chosen to reduce the discontinuities of 
the observation window at the edges t = 1 and t = N. Usually, it has the form 
ht = h ((2t — l)/(2iV)) with a function h that is independent of the sample size 
N. A popular choice is the split cosine taper 

{i(l-cos(2vrx/p)) 0<x<f ] 

1 |<s<l-f \. (1) 

\{l - cos(2vr(l - x)/p)) l-f<x<l J 

The tapered periodogram has the approximate variance 

Var[^)(/)]«S(/) 2 , /^{0,1/(2A)} (2) 



(see e.g. Percival and Walden ( 19931 ). Formula (222b)). In particular, it does 



not converge to zero. Because of this, one usually smoothes the periodogram 
over a small band of neighboring frequencies. We smooth discretely over an 
equidistant grid of frequencies. Let /j\r',fc = k/(N'A) (0 < k < N'/2) for an 
integer N' of order O(N). Then the tapered and smoothed spectral estimate is 

M 

s {ts) (fN>,k)= £ g j &*HfN>,k- j ), 

j=-M 

where the gj's are weights with the properties gj > 0, gj = g-j (— M < j < M) 
and Ylj=-M 9j = ^ k < M, the smoothing includes the value 5^(0) which 
is equal or very close to zero if the mean \x is estimated. In this case, we should 
exclude j = k from the sum. 
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3 Approximations of the variance of spectral estima- 
tors 



The usual approximation for the relative variance of S^ s '(f^r j.) is 



Var 



S (fa) (/Jv',fc) ' 

S(fN'k) 



c, 



M 



N' 



N ^ ■ 

r=-M 



(3) 



for k ^ 0, N'/2 where 



C h 



Eli K/n 



(4) 



This formula is given in iBloomfieldl (|2000h . equation (9.12) on p. 183, and 
it is impleme nted in the function "spec.pgram" in the language for statistical 
computing R (|R Development Core Team! (120101) ) . In o rder to see that it is the 
same as Formula (248a) in Percival and Walden ( 19931 ). one has to go back to 
the definition of W m in terms of the weights gj which is given by the formulae 
(237c), (238d) and (238e). If we put M = 0, © is different from ©. The reason 
for this difference is that ([3]) is valid in the limit M — > oo and M/N' — > 0. But 
in applications M is often small, e.g. M = 1, and one wonders how good the 
approximation is in such a case. 

We propose here as alternative the following approximation for the relative 
variance 



2M \Hi N) (f NI>l )\ M-l 



E ^ 2 + 2 E 



-M 



1=1 



E 9r9i 



+1 



(5) 



-M 



(again for k ^ 0,N'/2) where Hf\f) = i ^ e~ i2ntfA . In order to 
compute this expression, we need to compute the convolution of the weights 
(gj) and the discrete Fourier transform of the squared taper. The former is 
usually not a problem since M is substantially smaller than N'. Using the fast 
Fourier transform, exact computation of the latter is in most cases also possible. 
If not, then by the Lemma below we can use 



<>(/) 



h 2 



u e 



-i2irNu 



u e 



-iwfA_ 



vr/A 



sin(7r/A) ' 



Choosing a simple form for the function h, we can compute the integral on the 
right exactly. It is obvious that © agrees with ([2]) for M = 0. In the next 
section, we show that it also agrees with ([3]) for M large. 



4 Justification of the approximation 

The idea is simple: We just plug in a suitable approximation for the relative 
covariances of the tapered periodogram values into the exact expression for the 
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relative variance. Var [S^ ts \fiy',k)/S(fN',k)j is equal to 



M M 
•=-M s=-M 



N'k- 



S(fN',k-a) 



(6) 



The asymptotic behavior of these covariances is well known. Theorem 5.2.8 of 



Brillingerl (|1975l ) shows that, under suitable conditions, we have for frequencies 



< / < g < 1/(2 A) that 
Cov 



{ S(f) ' S(g) ) 



Hi N \f-g) + Hf>(f + g) 



<)(0) 



+ 0(N~ 1 ). (7) 



The statement in iBrillingerl (|1975l ) is actually asymmetric in / and g since it 
has S(f) instead of S(g) on the left side in the equation above. Our statement 
can be proved by the same argument if we assume S(f) S(g) when \f — g\ is 
small. When \f — g\ is big, i.e. not of order 0(A^ _1 ), the covariance is of the 
order 0(N^ 1 ) anyhow. Using the approximation ([7]) directly would lead to an 
approximation which depends on k. Having to compute N'/2 different approx- 
imate variances is usually too complicated. However, the term \H^\f + g)\ 2 is 
small unless A(f + g) is close to zero modulo one. This ha s been pointed out by 
Thomson! (|l977h . see also the discussion on p. 230-231 of iPer rival and Waldenl 



(|1993l ). If we omit this term, then we obtain our new approximation ([5]) by a 
simple change in the summation indices. 

We next give a simple lemma that justifies the omission of the second term in 
([7]). In addition, it also shows how the usual approximation ([3]) follows from 
©• 

Lemma 4.1. If ip is once continuously differentiable on [0, 1] and ifj' is Lipschitz 
continuous with constant L, then 



1 N 



N 



t=i 



2t - 1 
2N 



Jo 



du e 



-iir\ 



VTA 



sin(7rA) 



R 



where \R\ < const. /N uniformly for all A G [0,0.5]. 

Proof. Put e = l/(2A r ). By a Taylor expansion, we obtain for any x G [0, 1] 

' ^(u)e- i2nNXu du = iP{x)e- i27rNXx 
+ ^(x)e- i2 * NXx 



ue 



-i2wN\u 



du + R' 



where the remainder satisfies \Rl\ < 2e 3 L/3 = L/(12A r3 ). Next, observe that 



-i2irNXu 



du 



sin(-7rA) 
ttXN 



tic 



-i2nN\u 



du 



2tt\N 2 



cos(7rA) 



sin(7rA) 
7rA 



From this the lemma follows by taking x = (2t — 1)/(2N) for t = 1, . . . ,N and 
summing up all terms. □ 
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li ip(0) = ip(l) = 0, then by partial integration 



-i2ttN\u 



du 



< 



sup ^'(a;)! 



Hence by setting ip(u) = h?(u), we obtain 



H\ '(f) < const.(NfA)- 1 + const.A^ 1 < const. (Nf A)" 1 (8) 

for / < 1/A. Therefore the second term in (T7]) is negligible unless / + g is of 
the order 0(iV _1 ). 

Finally, we derive the usual variance approximation ([3]) from (0) as follows. By 
Parseval's theorem 



N'/2 



E KW-££E* 



l=-N'/2 
r( N )fn\ _ 1 /AT V-A 1,2 



2 N' 1 



A? 



TV iV 



t=i 



Note that ^ ; (0) = 1/N ■ J2t=i h l Because of ©, we have 



JV'/2 



E KW< E 



2 ^ /const.- iV- 2 



V iVZA 



!=2M+1 l=2M+l 

for M oo and JV' = 0(JV) -> oo. Thus 

2M 9 AT' 1 N 

E M-iiSf^ 

l=-2M t=l 

also in the above limit. If the weights (fy- change smoothly as a function g of 
the lag j, i.e., gj = g(j/M), then for any fixed I 

M-l „! M 

E 5r5r+i ~ M / g 2 (u)du ~ E 9r ( M ->■ °°) 
r=-Af 1 r=-M 

and the desired result follows by dominated convergence. 



5 Comparison with exact relative variances for Gaus- 
sian processes 



If we assume the process {X t } t £z to be Gaussian, then it holds 



Cov 



\ s(f) ' s( 9 ) J sunaXZLh^ 



N 



E h 3 h kS] . k e-^-^ 

j,k=l 



N 



j,k=i 
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sec p. 326 of IPercival and Waldenl f|l993l ). Plugging this into ([6]) yields thus an 
exact expression. Evaluation is of the order 0(iV 3 ), thus it is not practical to 
use it routinely. 

We now compare the two approximations to the exact relative variances for a 
Gaussian white noise Xt = et and the AR(4) process 



X t = 2.7607AVi - 3.8106AV 2 + 2.6535A", 



t-3 



0.9238AV4 + e t 



(9) 



used in Percival and Walden ( 19931 ) (see p. 46) where et i.i.d ~ A/"(0, 1). 
True spectra are shown in Figure Q] in decibel (dB), i.e., the plot displays 
10 log 10 (5'(.)). As we can see, the spectrum of the AR(4) process varies over a 
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Figure 1: Spectra of the Gaussian white noise and the AR(4) process in (|9|) in 
dB. 

wide range and exhibits two sharp peaks. Further, we assume the observation 
frequency 1/A to be 1 and N = 2 10 = 1024. We compute the exact rela- 
tive variance Q at the frequencies fk,N'i k = 0,. . . , N'/2, for N' G {N, 2N}, 
the split cosine taper ([1]) with p G {0.2,0.5} and weights gj = 1/(2M + 1), 
j = —M, . . . , M, with M G {0, 1,2}. Comparison to the usual approximation 
d3]) and to the new one (0) is shown in Figure [2J The code in R is available un- 
der http : //stat.ethz.ch/ ~ kuensch/papers/approximate_variances.R. 
We see that the new approximation fits the true relative variances clearly bet- 
ter when we smooth over few frequencies, i.e., M is small. Especially in the 
situations when the data is strongly tapered (p = 0.5) or when we use a refined 
smoothing grid (N' = 2N) we recommend to use the new approximation ([5]). 
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Figure 2: Exact relative variances of the Gaussian white noise (thick- 
dashed) and the AR(4) process (thick-solid) in comparison to the usual 
(thin-dashed) and the new (thin-solid) approximation for different choices 
of N', p and M. 
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